14 research outputs found

    Structuration sémantique des documents XML : Expérimentations et évaluation

    Get PDF
    National audienceLa norme XML permet la représentation d'un document selon un découpage logique qui ne reflète généralement pas la sémantique de son contenu. Il serait donc intéressant de compléter la structure logique des documents XML par une structure sémantique. L'objet de cet article est alors d'évaluer l'approche d'extraction de structures sémantiques pour les documents XML que nous effectuons sur un échantillon de documents issus de la collection ImageClef 2010 tout en utilisant le thésaurus MeSH (Medical Subject Headings)

    Diamond multidimensional model and aggregation operators for document OLAP

    Get PDF
    International audienceOn-Line Analytical Processing (OLAP) has generated methodologies for the analysis of structured data. However, they are not appropriate to handle document content analysis. Because of the fast growing of this type of data, there is a need for new approaches abling to manage textual content of data. Generally, these data exist in XML format. In this context, we propose an approach of construction of our Diamond multidimensional model, which includes semantic dimension to better consider the semantics of textual data In addition, we propose new aggregation operators for textual data in OLAP environment

    OLAP4Tweets: Multidimensional Modeling of tweets

    Get PDF
    International audienceTwitter, a popular microblogging platform, is at the epicenter of the social media explosion, with millions of users being able to create and publish short posts, referred to as tweets, in real time. The application of the OLAP (On-Line Analytical Processing) on large volumes of tweets is a challenge that would allow the extraction of information (especially knowledge) such as user behavior, new emerging issues, trends… In this paper, we pursue a goal of providing a generic multidimensional model dedicated to the OLAP of tweets. The proposed model reflects on some specifics such as recursive references between tweets and calculated attributes

    A novel multidimensional model for the OLAP on documents : modeling, generation and implementation

    Get PDF
    International audienceAs the amount of textual information grows explosively in various kinds of business systems, it becomes more and more essential to analyze both structured data and unstructured textual data simultaneously. However information contained in non structured data (documents and so on) is only partially used in business intelligence (BI). Indeed On-Line Analytical Processing (OLAP) cubes which are the main support of BI analysis in decision support systems have focused on structured data. This is the reason why OLAP is being extended to unstructured textual data. In this paper we introduce the innovative “Diamond” multidimensional model that will serve as a basis for semantic OLAP on XML documents and then we describe the meta modeling, generation and implementation of a the Diamond multidimensional model

    Modèle multidimensionnel en diamant dédié à l'OLAP sémantique de documents

    Get PDF
    National audienceLe document électronique représente aujourd’hui un support d’information que les entreprises ne peuvent plus négliger si elles veulent être certaines d’identifier et de gérer toutes les données qui leur sont utiles au quotidien. Plusieurs travaux ont proposé l’application des techniques OLAP (« On-line Analytical Processing ») aux informations documentaires. Dans cet article, nous présentons un nouveau modèle multidimensionnel dédié à l’OLAP de documents. Ce modèle, dit en diamant, est organisé autour d’une dimension centrale qui traduit la sémantique du contenu textuel du document

    Modeling and OLAPing social media : the case of Twitter

    Get PDF
    In the recent year, social networks have revolutionized the ways of interacting and exchanging information on the Internet. Millions of users interact frequently and share variety of digital content with each other. They express their feelings and opinions on every topic of interest. These opinions carry import value for personal, academic, and commercial applications, but the volume and the speed at which these are produced make it a challenging task for researchers and the underlying technologies to provide useful insights into such data. We attempt to extend the established online analytical processing (OLAP) technology to allow multidimensional analysis of social media data. In this paper, we pursue a goal of providing a generic multidimensional model dedicated to the OLAP of social media and specially Twitter. The proposed model reflects on some specifics such as recursive references between tweets, Empty dimension, and different types of hierarchies. It is implemented using NetBeans IDE platform. We present also some experimental results. We expect our proposed approach to be applicable for analyzing the data of other social networks as well

    Science, technology and innovation policy evaluation: An isolated academic and practice field

    Get PDF
    Trabajo presentado a la EU-SPRI Conference: "Science and Innovation Policy: Dynamics, Challenges, Responsibility and Practice", celebrada en Manchester (UK) del 18 al 20 de junio de 2014.Peer Reviewe

    EntrepĂ´ts de documents (de l'alimentation Ă  l'exploitation)

    No full text
    TOULOUSE3-BU Sciences (315552104) / SudocSudocFranceF
    corecore